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(1) GENERAL INFORMATION: 

(i) APPLICANT: ANDERSON, 
HANNA, Nab 
LEONARD, 
NEWMAN, 
RSFF, 

RAS TETTER, 



Mite he 



(ii) TITLE OF INVENTION: 
RADIOLABELED 
D IFFERENTIATION 
LYMPHOMA 



SEQUENCE LISTING 



Oarrell R. 
LI 

E . 
A. 

II -E. 
William H. 



John 
Roland 



THERAPEUTIC APPLICATION OF CHIMERIC AND 
ANTI10DIES TO HUMAN B LYMPHOCYTE RESTRICTED 
aStIGEN FOR THE TREATMENT OF B CELL 



(iii) NUMBER OF SEQUENCES : I 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BURlJs, DOANE, SWECKER & MATHIS 

(B) STREET: P.O. B*X 1404 

(C) CITY: Alexandria 

(D) STATE: Virgini 

(E) COUNTRY: United States 

(F) ZIP: 22313-1404 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: /Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS ^ 

(D) SOFTWARE : Papentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/149,099 

(B) FILING DATE] 03 -NOV- 19 93 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/978,891 

(B) FILING DATS: 12-NOV-1992 

(viii) ATTORNEY/ AGENT INFORMATION : 

(A) NAME: Teskin, Robin L. 

(B) REGISTRATION NUMBER: 35,030 

(C) REFERENCE /DOCKET NUMBER: 012712-014 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE :j (703) 836-5520 

(B) TELEFAX: (j703) 836-2021 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2V base pairs 

(B) TYPE: nudLeic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY/ linear 

(ii) MOLECULE TYPBf: DNA (genomic) 



(xi) SEQUENCE DES 
GGGAGCTTGG ATCGATCCTC 



rRIPTION: SEQ ID NO:l: 
TATGGTT 



27 
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# 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 
AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 
GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 
ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 
GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC. TTTGCATACT TCTGCCTGCT 
GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 
AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 
GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 
ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 
TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 
AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 
ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 
ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 
TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 
GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 
CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 
CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 
CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 
TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 
TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 
CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 
AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 
TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 
TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 
GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 
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60 
120 
180 
240 
300 
360 
420 
430 
540 
600 
660 
720 
780 
840 
900 
960 
1020. 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



AGACAGTGTT 
GACTCCTAAG 
GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CAAACTTGAC 
GTTCGACCAT 
GACCTACCCT 
TCTTCAGTGG 
CCTGAGAACA 
GAACCACCAC 
GAACAACCGG 
TACCAGGAAG 
GAATTTGAAA 
GAATACCCAG 
GTCTACGAGA 
TCATGCATTT 
TTCTAGTTGC 
TGCCACTCCC 
GTGTCATTCT 
CAATAGCAGG 
TAGCTTTGCT 
ACCAATTCAG 
TCTCTGCACA 
GCCAGTGAGT 
CGTAGAGCCA 
AGGGCAGAGC 
TTGTGTTGGG 
TCCGGCCGCT 
CTCTGATGCC 
CGACCTGTCC 
CACGACGGGC 
GCTGCTATTG 
GAAAGTATCC 



CTCTGCACAG 
CCAGTGAGTG 
GTAGAGCCAC 
GGGCAGAGCA 
TGTGTTGGGA 
GGCAATCCTA 
TGAACTGCAT 
GGCCTCCGCT 
AAGGTAAACA 
ATCGACCTTT 
GAGGAGCTCA 
AATTGGCAAG 
CCATGAATCA 
GTGACACGTT 
GCGTCCTCTC 
AGAAAGACTA 
TTATAAGACC 
CAGCCATCTG 
ACTGTCCTTT 
ATTCTGGGGG 
CATGCTGGGG 
TCTCAATTTC 
TAGTTGATTG 
GATAAGGACA 
GGCACAGCAT 
CACCTTGGTA 
ATATAAGGTG 
AGCTTGGATC 
TGGGTGGAGA 
GCCGTGTTCC 
GGTGCCCTGA 
GTTCCTTGCG 
GGCGAAGTGC 
ATCATGGCTG 



ATAAGGACAA 
GCACAGCATT 
ACCTTGGTAA 
TATAAGGTGA 
GCTTGGATAG 
GCGTGAAGGC 
CGTCGCCGTG 
CAGGAACGAG 
GAATCTGGTG 
AAAGGACAGA 
TTTTCTTGCC 
TAAAGTAGAC 
ACCAGGCCAC 
TTTCCCAGAA 
TGAGGTCCAG 
ACAGGAAGAT 
ATGGGACTTT 
TTGTTTGCCC 
CCTAATAAAA 
GTGGGGTGGG 
ATGCGGTGGG 
TTATTTGCAT 
AGCAAATGCG 
AACATTATTC 
TCTAGGGAGA 
AGGGCCAATC 
AGGTAGGATC 
GATCCTCTAT 
GGCTATTCGG 
GGCTGTCAGC 
ATGAACTGCA 
CAGCTGTGCT 
CGGGGCAGGA 
ATGCAATGCG 



ACATTATTCA 
CTAGGGAGAA 
GGGCCAATCT 
GGTAGGATCA 
CTTGGACAGC 
TGGTAGGATT 
TCCCAAAATA 
TTCAAGTACT 
ATTATGGGTA 
ATTAATATAG 
AAAAGTTTGG 
ATGGTTTGGA 
CTTAGACTCT 
ATTGATTTGG 
GAGGAAAAAG 
GCTTTCAAGT 
TGCTGGCTTT 
CTCCCCCGTG 
TGAGGAAATT 
GCAGGACAGC 
CTCTATGGAA 
AATGAGAAAA 
TTGCCAAAAA 
AGAGGGAGTA 
AATATGCTTG 
TGCTCACACA 
AGTTGCTCCT 
GGTTGAACAA 
CTATGACTGG 
GCAGGGGCGC 
GGACGAGGCA 
CGACGTTGTC 
TCTCCTGTCA 
GCGGCTGCAT 



GAGGGAGTAC 
ATATGCTTGT 
GCTCACACAG 
GTTGCTCCTC 
TCAGGGCTGC 
TTATCCCCGC 
TGGGGATTGG 
TCCAAAGAAT 
GGAAAACCTG 
TTCTCAGTAG 
ATGATGCCTT 
TAGTCGGAGG 
TTGTGACAAG 
GGAAATATAA 
GCATCAAGTA 
TCTCTGCTCC 
AGATCAGCCT 
CCTTCCTTGA 
GCATCGCATT 
AAGGGGGAGG 
CCAGCTGGGG 
AAAGGAAAAT 
GGATGCTTTA 
CCCAGAGCTG 
TCATCACCGA 
GGATAGAGAG 
CACATTTGCT 
GATGGATTGC 
GCACAACAGA 
CCGGTTCTTT 
GCGCGGCTAT 
ACTGAAGCGG 
TCTCACCTTG 
ACGCTTGATC 



CCAGAGCTGA 
CATCACCGAA 
GATAGAGAGG 
ACATTTGCTT 
GATTTCGCGC 
TGCCATCATG 
CAAGAACGGA 
GACCACAACC 
GTTCTCCATT 
AGAACTCAAA 
AAGACTTATT 
CAGTTCTGTT 
GATCATGCAG 
ACTTCTCCCA 
TAAGTTTGAA 
CCTCCTAAAG 
CGACTGTGCC 
CCCTGGAAGG 
GTCTGAGTAG 
ATTGGGAAGA 
CTCGAGCTAC 
TAATTTTAAC 
GAGACAGTGT 
AGACTCCTAA 
AGCCTGATTC 
GGCAGGAGCC 
TCTGACATAG 
ACGCAGGTTC 
CAATCGGCTG 
TTGTCAAGAC 
CGTGGCTGGC 
GAAGGGACTG 
CTCCTGCCGA 
CGGCTACCTG 



3660 . 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4630 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 
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TAATAAAATG 
GGGGTGGGGC 
GCGGTGGGCT 
ACGTCAATGA 
TTCCTACTTG 
GGCAGTACAT 
CCATTGACGT 
GTAACAACTC 
TAAGCAGAGC 
ATGGGTTGGA 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCCTCTTCC 
TGCGTGGTGG 
GGCGTGGAGG 
CGTGTGGTCA 
TGCAAGGTCT 
GGGCAGCCCC 
AACCAGGTCA 
TGGGAGAGCA 
GACGGCTCCT 
AACGTCTTCT 
CTCTCCCTGT 
TCGTGACAAC 
GCCAGCCATC 
CCACTGTCCT 
CTATTCTGGG 
GGCATGCTGG 
CCCGATCCCC 
AATTTTAACA 



AGGAAATTGC 
AGGACAGCAA 
CTATGGAACC 
CGGTAAATGG 
GCAGTACATC 
CAATGGGCGT 
CAATGGGAGT 
CGCCCCATTG 
TGGGTACGTC 
GCCTCATCTT 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
CCCCAAAACC 
TGGACGTGAG 
TGCATAATGC 
GCGTCCTCAC 
CCAACAAAGC 
GAGAACCACA 
GCCTGACCTG 
ATGGGCAGCC 
TCTTCCTCTA 
CATGCTCCGT 
CTCCGGGTAA 
ATGCGGCCGT 
TGTTGTTTGC 
TTCCTAATAA 
GGGTGGGGTG 
GGATGCGGTG 
AGCTTTGCTT 
CCAATTCAGT 



ATCGCATTGT 
GGGGGAGGAT 
AGCTGGGGCT 
CCCGCCTGGC 
TACGTATTAG 
GGATAGCGGT 
TTGTTTTGGC 
ACGCAAATGG 
CTCACATTCA 
GCTCTTCCTT 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
CAAGGACACC 
CCACGAAGAC 
CAAGACAAAG 
CGTCCTGCAC 
CCTCCCAGCC 
GGTGTACACC 
CCTGGTCAAA 
GGAGAACAAC 
CAGCAAGCTC 
GATGCATGAG 
ATGAGGATCC 
GATATCTACG 
CCCTCCCCCG 
AATGAGGAAA 
GGGCAGGACA 
GGCTCTATGG 
CTCAATTTCT 
AGTTGATTGA 



CTGAGTAGGT 
TGGGAAGACA 
CGACAGCTAT 
ATTATGCCCA 
TCATCGCTAT 
TTGACTCACG 
ACCAAAATCA 
GCGGTAGGCG 
GTGATCAGCA 
GTCGCTGTTG 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAC 
CTCATGATCT 
CCTGAGGTCA 
CCGCGGGAGG 
CAGGACTGGC 
CCCATCGAGA 
CTGCCCCCAT 
GGCTTCTATC 
TACAAGACCA 
ACCGTGGACA 
GCTCTGCACA 
GTTAACGGTT 
TATGATCAGC 
TGCCTTCCTT 
TTGCATCGCA 
GCAAGGGGGA 
AACCAGCTGG 
TATTTGCATA 
GCAAATGCGT 



GTCATTCTAT 
ATAGCAGGCA 
GCCAAGTACG 
GTACATGACC 
TACCATGGTG 
GGGATTTCCA 
ACGGGACTTT 
TGTACGGTGG 
CTGAACACAG 
CTACGCGTGT 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGCAGAGCC 
TCCTGGGGGG 
CCCGGACCCC 
AGTTCAACTG 
AGCAGTACAA 
TGAATGGCAA 
AAACCATCTC 
CCCGGGATGA 
CCAGCGACAT 
CGCCTCCCGT 
AGAGCAGGTG 
ACCACTACAC 
ACCAACTACC 
CTCGACTGTG 
GACCCTGGAA 
TTGTCTGAGT 
GGATTGGGAA 
GGCTCGACAG 
ATGAGAAAAA 
TGCCAAAAAG 



TCTGGGGGGT 
TGCTGGGGAT 
CCCCCTATTG 
TTATGGGACT 
ATGCGGTTTT 
AGTCTCCACC 
CCAAAATGTC 
GAGGTCTATA 
ACCCGTCGAC 
CGCTAGCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
ACCGTCAGTC 
TGAGGTCACA 
GTACGTGGAC 
CAGCACGTAC 
GGACTACAAG 
CAAAGCCAAA 
GCTGACCAGG 
CGCCGTGGAG 
GCTGGACTCC 
GCAGCAGGGG 
GCAGAAGAGC 
TAGACTGGAT 
CCTTCTAGTT 
GGTGCCACTC 
AGGTGTCATT 
GACAATAGCA 
CGCTGGATCT 
AAGGAAAATT 
GATGCTTTAG 



•1620 . 
1630 
1740 
1300 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2830 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
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CCCATTCGAC 
TCTTGTCGAT 
CGCCAGGCTC 
CTGCTTGCCG 
GCTGGGTGTG 
GCTTGGCGGC 
CGCAGCGCAT 
CGAAATGACC 
OTTCTATGAA 
GCGCGGGGAT 
TGGTTACAAA 
TTCTAGTTGT 
GATCCCGTCG 
CCGCTCACAA 
AATGAGTGAG 
ACCTGTCGTG 
TTGGGCGCTC 
GAGCGGTATC 
CAGGAAAGAA 
TGCTGGCGTT 
GTCAGAGGTG 
CCCTCGTGCG 
CTTCGGGAAG 
TCGTTCGCTC 
TATCCGGTAA 
CAGCCACTGG 
AGTGGTGGCC 
AGCCAGTTAC 
GTAGCGGTGG 
AAGATCCTTT 
GGATTTTGGT 
GAAGTTTTAA 
TAATCAGTGA 
TCCCCGTCGT 



CACCAAGCGA 
CAGGATGATC 
AAGGCGCGCA 
AATATCATGG 
GCGGACCGCT 
GAATGGGCTG 
CGCCTTCTAT 
GACCAAGCGA 
AGGTTGGGCT 
CTCATGCTGG 
TAAAGCAATA 
GGTTTGTCCA 
AGAGCTTGGC 
TTCCACACAA 
CTAACTCACA 
CCAGCTGCAT 
TTCCGCTTCC 
AGCTCACTCA 
CATGTGAGCA 
TTTC CATAGG 
GCGAAA.CCCG 
CTCTCCTGTT 
CGTGGCGCTT 
CAAGCTGGGC 
CTATCGTCTT 
TAACAGGATT 
TAACTACGGC 
CTTCGGAAAA 
TTTTTTTGTT 
GATCTTTTCT 
CATGAGATTA 
ATCAATCTAA 
GGCACCTATC 
GTAGATAACT 



AACATCGCAT 
TGGACGAAGA 
TGCCCGACGG 
TGGAAAATGG 
ATCAGGACAT 
ACCGCTTCCT 
CGCCTTCTTG 
CGCCCAACCT 
TCGGAATCGT 
AGTTCTTCGC 
GCATCACAAA 
AACTCATCAA 
GTAATCATGG 
CATACGAGCC 
TTAATTGCGT 
TAATGAATCG 
TCGCTCACTG 
AAGGCGGTAA 
AAAGGCCAGC 
CTCCGCCCCC 
ACAGGACTAT 
CCGACCCTGC 
TCTCAATGCT 
TGTGTGCACG 
GAGTCCAACC 
AGCAGAGCGA 
TACACTAGAA 
AGAGTTGGTA 
TGCAAGCAGC 
ACGGGGTCTG 
TCAAAAAGGA 
AGTATATATG 
TCAGCGATCT 
ACGATACGGG 



CGAGCGAGCA 
GCATCAGGGG 
CGAGGATCTC 
CCGCTTTTCT 
AGCGTTGGCT 
CGTGCTTTAC 
ACGAGTTCTT 
GCCATCACGA 
TTTCCGGGAC 
CCACCCCAAC 
TTTCACAAAT 
TCTATCTTAT 
TCATAGCTGT 
GGAGCATAAA 
TGCGCTCACT 
GCCAACGCGC 
ACTCGCTGCG 
TACGGTTATC 
AAAAGGCCAG 
CTGACGAGCA 
AAAGATACCA 
CGCTTACCGG 
CACGCTGTAG 
AACCCCCCGT 
CGGTAAGACA 
GGTATGTAGG 
GGACAGTATT 
GCTCTTGATC 
AGATTACGCG 
ACGCTCAGTG 
TCTTCACCTA 
AGTAAACTTG 
GTCTATTTCG 
AGGGCTTACC 



CGTACTCGGA 
CTCGCGCCAG 
GTCGTGACCC 
GGATTCATCG 
ACCCGTGATA 
GGTATCGCCG 
CTGAGCGGGA 
GATTTCGATT 
GCCGGCTGGA 
TTGTTTATTG 
AAAGCATTTT 
CATGTCTGGA 
TTCCTGTGTG 
GTGTAAAGCC 
GCCCGCTTTC 
GGGGAGAGGC 
CTCGGTCGTT 
CACAGAATCA 
GAACCGTAAA 
TCACAAAAAT 
GGCGTTTCCC 
ATACCTGTCC 
GTATCTCAGT 
TCAGCCCGAC 
CGACTTATCG 
CGGTGCTACA 
TGGTATCTGC 
CGGCAAACAA 
CAGAAAAAAA 
GAACGAAAAC 
GATCCTTTTA 
GTCTGACAGT 
TTCATCCATA 
ATCTGGCCCC 



TGGAAGCCGG 
CCGAACTGTT 
ATGGCGATGC 
ACTGTGGCCG 
TTGCTGAAGA 
CTTCCCGATT 
CTCTGGGGTT 
CCACCGCCGC 
TGATCCTCCA 
CAGCTTATAA 
TTTCACTGCA 
TCGCGGCCGC 
AAATTGTTAT 
TGGGGTGCCT 
CAGTCGGGAA 
GGTTTGCGTA 
CGGCTGCGGC 
GGGGATAACG 
AAGGCCGCGT 
CGACGCTCAA 
CCTGGAAGCT 
GCCTTTCTCC 
TCGGTGTAGG 
CGCTGCGCCT 
CCACTGGCAG 
GAGTTCTTGA 
GCTCTGCTGA 
ACCACCGCTG 
GGATCTCAAG 
TCACGTTAAG 
AATTAAAAAT 
TACCAATGCT 
GTTGCCTGAC 
AGTGCTGCAA 



5700 . 
57S0 
5320 
5380 
5940 
6000 
606Q 
6120 
6130 
6240 
6300 
. 6360 
6420 
6430 
6540 
6600 
6660 
6720 
6780 
6340 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7630 
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TGATACCGCG 
GAAGGGCCGA 
GTTGCCGGGA 
TTGCTACAGG 
CCCAACGATC 
TCGGTCCTCC 
CAGCACTGCA 
AGTACTCAAC 
CGTCAATACG 
AACGTTCTTC 
AACCCACTCG 
GAGCAAAAAC 
GAATACTCAT 
TGAGCGGATA 
TTCCCCGAAA 



AGACCCACGC 
GCGCAGAAGT 
AGCTAGAGTA 
CATCGTGGTG 
AAGGCGAGTT 
GATCGTTGTC 
TAATTCTCTT 
CAAGTCATTC 
GGATAATACC 
GGGGCGAAAA 
TGCACCCAAC 
AGGAAGGCAA 
ACTCTTCCTT 
CATATTTGAA 
AGTGCCACCT 



TCACCGGCTC 
GGTCCTGCAA 
AGTAGTTCGC 
TCACGCTCGT 
ACATGATCCC 
AGAAGTAAGT 
ACTGTCATGC 
TGAGAATAGT 
GCGCCACATA 
CTCTCAAGGA 
TGATCTTCAG 
AATGCCGCAA 
TTTCAATATT 
TGTATTTAGA 



CAGATTTATC 
CTTTATCCGC 
CAGTTAATAG 
CGTTTGGTAT 
CGATGTTGTG 
TGGCCGCAGT 
CATCCGTAAG 
GTATGCGGCG 
GCAGAACTTT 
TCTTACCGCT 
CATCTTTTAC 
AAAAGGGAAT 
ATTGAAGCAT 
AAAATAAACA 



AGCAATAAAC 

CTCCATCCAG 

TTTGCGCAAC 

GGCTTCATTC 

CAAAAAAGCG 

GTTATCACTC 

ATGCTTTTCT 

ACCGAGTTGC 

AAAAGTGCTC 

GTTGAGATCC 

TTTCACCAGC 

AAGGGCGACA 

TTATCAGGGT 

AATAGGGGTT 



CAGCCAGCCG 

TCTATTAATT 

GTTGTTGCCA 

AGCTCCGGTT 

GTTAGCTCCT 

ATGGTTATGG 

GTGACTGGTG 

TCTTGCCCGG 

ATCATTGGAA 

AGTTCGATGT 

GTTTCTGGGT 

CGGAAATGTT 

TATTGTCTCA 

CCGCGCACAT 



(2) INFORMATION FOR SSQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA 
AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT 
GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG 
ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC 
GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC 
GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT 
AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA 
GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC 
ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG 
TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC 



CTACTTCTGG 
TAGTCAGCCA 
GCGGAGTTAG 
TGCCTGCTGG 
TTTGCATACT 
TCCACAGAAT 
GCCCATATAT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 



AATAGCTCAG 

TGCATGGGGC 

GGGCGGGACT 

GGAGCCTGGG 

TCTGCCTGCT 

TAATTCCCCT 

GGAGTTCCGC 

CCGCCCATTG 

TTGACGTCAA 

TCATATGCCA 



7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CATCACAGAT 
GCTTCAGTCA 
GCATCTCCAG 
CACTGGTTCC 
CTGGCTTCTG 
ACAATCAGCA 
AACCCACGCA 
TCTGTCTTCA 
TGCCTGCTGA 
CTCCAATCGG 
AGCCTCAGCA 
TGCGAAGTCA 
TGTTGAATTC 
GGCCGTGATA 
GTTTGCCCCT 
TAATAAAATG 
GGGGTGGGGC 
GCGGTGGGCT 
ACGTCAATGA 
TTCCTACTTG 
GGCAGTACAT 
CCATTGACGT 
GTAACAACTC 
TAAGCAGAGC 
ATGGGTTGGA 
GTACAACTGC 
TGCAAGGCTT 
GGTCGGGGCC 



CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
CTCTCACTAT 
TAATGTCCAG 
GGGAGAAGGT 
AGCAGAAGCC 
GAGTCCCTGT 
GAGTGGAGGC 
CGTTCGGAGG 
TCTTCCCGCC 
ATAACTTCTA 
GTAACTCCCA 
GCACCCTGAC 
CCCATCAGGG 
AGATCCGTTA 
TCTACGTATG 
CCCCCGTGCC 
AGGAAATTGC 
AGGACAGCAA 
CTATGGAACC 
CGGTAAATGG 
GCAGTACATC 
CAATGGGCGT 
CAATGGGAGT 
CGCCCCATTG 
TGGGTACGTC 
GCCTCATCTT 
AGCAGCCTGG 
CTGGCTACAC 
TGGAATGGAT 



CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTGGG 
GGATTTTCAG 
AGGACAAATT 
CACAATGACT 
AGGATCCTCC 
TCGCTTCAGT 
TGAAGATGCT 
GGGGACCAAG 
ATCTGATGAG 
TCCCAGAGAG 
GGAGAGTGTC 
GCTGAGCAAA 
CCTGAGCTCG 
ACGGTTACCA 
ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
GGGGGAGGAT 
AGCTGGGGCT 
CCCGCCTGGC 
TACGTATTAG 
GGATAGCGGT 
TTGTTTTGGC 
ACGCAAATGG 
CTCACATTCA 
GCTCTTCCTT 
GGCTGAGCTG 
ATTTACCAGT 
TGGAGCTATT 



AAATGGCCCG CCTGGCATTA 
TACATCTACG TATTAGTCAT 
GGGCGTGGAT ACCGGTTTGA 
GGGAGTTTGT TTTGGCACCA 
CCATTGACGC AAATGGGCGG 
TACGTGAACC GTCAGATCGC 
GTGCAGATTA TCAGCTTCCT 
GTTCTCTCCC AGTCTCCAGC 
TGCAGGGCCA GCTCAAGTGT 
CCCAAACCCT GGATTTATGC 
GGCAGTGGGT CTGGGACTTC 
GCCACTTATT ACTGCCAGCA 
CTGGAAATCA AACGTACGGT 
CAGTTGAAAT CTGGAACTGC 
GCCAAAGTAC AGTGGAAGGT 
ACAGAGCAGG ACAGCAAGGA 
GCAGACTACG AGAAACACAA 
CCCGTCACAA AGAGCTTCAA 
ACTACCTAGA CTGGATTCGT 
ACTGTGCCTT CTAGTTGCCA 
CTGGAAGGTG CCACTCCCAC 
CTGAGTAGGT GTCATTCTAT 
TGGGAAGACA ATAGCAGGCA 
CGACAGCTAT GCCAAGTACG 
ATTATGCCCA GTACATGACC 
TCATCGCTAT TACCATGGTG 
TTGACTCACG GGGATTTCCA 
ACCAAAATCA ACGGGACTTT 
GCGGTAGGCG TGTACGGTGG 
GTGATCAGCA CTGAACACAG 
GTCGCTGTTG CTACGCGTGT 
GTGAAGCCTG GGGCCTCAGT 
TACAATATGC ACTGGGTAAA 
TATCCCGGAA ATGGTGATAC 



TGCCCAGTAC 
CGCTATTACC 
CTCACGCGGA 
AAATCAACGG 
TAGGCGTGTA 
CTGGAGACGC 
GCTAATCAGT 
AATCCTGTCT 
AAGTTACATC 
CACATCCAAC 
TTACTCTCTC 
GTGGACTAGT 
GGCTGCACCA 
CTCTGTTGTG 
GGATAACGCC 
CAGCACCTAC 
AGTCTACGCC 
CAGGGGAGAG 
GACAACATGC 
GCCATCTGTT 
TGTCCTTTCC 
TCTGGGGGGT 
TGCTGGGGAT 
CCCCCTATTG 
TTATGGGACT 
ATGCGGTTTT 
AGTCTCCACC 
CCAAAATGTC 
GAGGTCTATA 
ACCCGTCGAC 
CCTGTCCCAG 
GAAGATGTCC 
ACAGACACCT 
TTCCTACAAT 



- 660 . 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S80 
2640 
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CAGAAGTTCA 


AAGGCAAGGC \ 


^pATTGACT 


GCAGACAAAT 


CCTCCAGCAC 


CTACATG 


2700 




CAGCTCAGCA 


GCCTGACATC 


TGAGGACTCT 


GCGGTCTATT 


ACTGTGCAAG 


ATCGACTTAC 


2760 




TACGGCGGTG 


ACTGGTACTT 


CAATGTCTGG 


GGCGCAGGGA 


CCACGGTCAC 


CGTCTCTGCA 


2820 




GCTAGCACCA 


AGGGCCCATC 


GGTCTTCCCC 


CTGGCACCCT 


CCTCCAAGAG 


CACCTCTGGG 


2330 




GGCACAGCGG 


CCCTGGGCTG 


CCTGGTCAAG 


GACTACTTCC 


CCGAACCGGT 


GACGGTGTCG 


2940 


* 


TGGAACTCAG 


GCGCCCTGAC 


CAGCGGCGTG 


CACACCTTCC 


CGGCTGTCCT 


ACAGTCCTCA 


3000 




GGACTCTACT 


CCCTCAGCAG 


CGTGGTGACC 


GTGCCCTCCA 


GCAGCTTGGG 


CACCCAGACC 


3060 




TACATCTGCA 


ACGTGAATCA 


CAAGCCCAGC 


AACACCAAGG 


TGGACAAGAA 


AGCAGAGCCC 


3120 




AAATCTTGTG 


ACAAAACTCA 


CACATGCCCA 


CCGTGCCCAG 


CACCTGAACT 


CCTGGGGGGA 


3130 




CCGTCAGTCT 


TCCTCTTCCC 


CCCAAAACCC 


AAGGACACCC 


TCATGATCTC 


CCGGACCCCT 


3240 




GAGGTCACAT 


GCGTGGTGGT 


GGACGTGAGC 


CACGAAGACC 


CTGAGGTCAA 


GTTCAACTGG 


3300 




TACGTGGACG 


GCGTGGAGGT 


GCATAATGCC 


AAGACAAAGC 


CGCGGGAGGA 


GCAGTACAAC 


3360 




AGCACGTACC 


GTGTGGTCAG 


CGTCCTCACC 


GTCCTGCACC 


AGGACTGGCT 


GAATGGCAAG 


3420 




GAGTACAAGT 


GCAAGGTCTC 


CAACAAAGCC 


CTCCCAGCCC 


CCATCGAGAA 


AACCATCTCC 


3430 




AAAGCCAAAG 


GGCAGCCCCG 


AGAACCACAG 


GTGTACACCC 


TGCCCCCATC 


CCGGGATGAG 


3540 


\m 


CTGACCAAGA 


ACCAGGTCAG 


CCTGACCTGC 


CTGGTCAAAG 


GCTTCTATCC 


CAGCGACATC 


3600 




GCCGTGGAGT 


GGGAGAGCAA 


TGGGCAGCCG 


GAGAACAACT 


ACAAGACCAC 


GCCTCCCGTG 


3660 




CTGGACTCCG 


ACGGCTCCTT 


CTTCCTCTAC 


AGCAAGCTCA 


CCGTGGACAA 


GAGCAGGTGG 


3720 


1 . r 


CAGCAGGGGA 


ACGTCTTCTC 


ATGCTCCGTG 


ATGCATGAGG 


CTCTGCACAA 


CCACTACACG 


3780 




CAGAAGAGCC 


TCTCCCTGTC 


TCCGGGTAAA 


TGAGGATCCG 


TTAACGGTTA 


CCAACTACCT 


3840 


m 


AGACTGGATT 


CGTGACAACA 


TGCGGCCGTG 


ATATCTACGT 


ATGATCAGCC 


TCGACTGTGC 


3900 




CTTCTAGTTG 


CCAGCCATCT 


GTTGTTTGCC 


CCTCCCCCGT 


GCCTTCCTTG 


ACCCTGGAAG 


3960 


' %A 


GTGCCACTCC 


CACTGTCCTT 


TCCTAATAAA 


ATGAGGAAAT 


TGCATCGCAT 


TGTCTGAGTA 


4020 


u 


GGTGTCATTC 


TATTCTGGGG 


GGTGGGGTGG 


GGCAGGACAG 


CAAGGGGGAG 


GATTGGGAAG 


4080 




ACAATAGCAG 


GCATGCTGGG 


GATGCGGTGG 


GCTCTATGGA 


ACCAGCTGGG 


GCTCGACAGC 


4140 




GCTGGATCTC 


CCGATCCCCA 


GCTTTGCTTC 


TCAATTTCTT 


ATTTGCATAA 


TGAGAAAAAA 


4200 




AGGAAAATTA 


ATTTTAACAC 


CAATTCAGTA 


GTTGATTGAG 


CAAATGCGTT 


GCCAAAAAGG 


4260 




ATGCTTTAGA 


GACAGTGTTC 


TCTGCACAGA 


TAAGGACAAA 


CATTATTCAG 


AGGGAGTACC 


4320 


1 - 


CAGAGCTGAG 


ACTCCTAAGC 


CAGTGAGTGG 


CACAGCATTC 


TAGGGAGAAA 


TATGCTTGTC 


4380 




ATCACCGAAG 


CCTGATTCCG 


TAGAGCCACA 


CCTTGGTAAG 


GGCCAATCTG 


CTCACACAGG 


4440 




ATAGAGAGGG 


CAGGAGCCAG 


GGCAGAGCAT 


ATAAGGTGAG 


GTAGGATCAG 


TTGCTCCTCA 


4500 




CATTTGCTTC 


TGACATAGTT 


GTGTTGGGAG 


CTTGGATAGC 


TTGGACAGCT 


CAGGGCTGCG 


4560 




ATTTCGCGCC 


AAACTTGACG 


GCAATCCTAG 


CGTGAAGGCT 


GGTAGGATTT 


TATCCCCGCT 


4620 




GCCATCATGG 


TTCGACCATT 


GAACTGCATC 


GTCGCCGTGT 


CCCAAAATAT 


GGGGATTGGC - 


4680 
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on 



AAGAACGGAG 
ACCACAACCT 
TTCTCCATTC 
GAACTCAAAG 
AGACTTATTG 
AGTTCTGTTT 
ATCATGCAGG 
CTTCTCCCAG 
AAGTTTGAAG 
CTCCTAAAGC 
GACTGTGCCT 
CCTGGAAGGT 
TCTGAGTAGG 
TTGGGAAGAC 
TCGAGCTACT 
AATTTTAACA 
AGACAGTGTT 
GACTCCTAAG 
- GCCTGATTCC 
GCAGGAGCCA 
CTGACATAGT 
CGCAGGTTCT 
AATCGGCTGC 
TGTCAAGACC 
GTGGCTGGCC 
AAGGGACTGG 
TCCTGCCGAG 
GGCTACCTGC 
GGAAGCCGGT 
CGAACTGTTC 
TGGCGATGCC 
CTGTGGCCGG 
TGCTGAAGAG 
TCCCGATTCG 



ACCTACCCTG^^TCCGCTC AGGAACGAGT TCAAGTACTI^^AAGAATG 

CTTCAGTGGA AGGTAAACAG AATCTGGTGA TTATGGGTAG GAAAACCTGG 
CTGAGAAGAA TCGACCTTTA AAGGACAGAA TTAATATAGT TCTCAGTAGA 
AACCACCACG AGGAGCTCAT TTTCTTGCCA AAAGTTTGGA TGATGCCTTA 
AACAACCGGA ATTGGCAAGT AAAGTAGACA TGGTTTGGAT AGTCGGAGGC 
ACCAGGAAGC CATGAATCAA CCAGGCCACC TTAGACTCTT TGTGACAAGG 
AATTTGAAAG TGACACGTTT TTCCCAGAAA TTGATTTGGG GAAATATAAA 
AATACCCAGG CGTCCTCTCT GAGGTCCAGG AGGAAAAAGG CATCAAGTAT 
TCTACGAGAA GAAAGACTAA CAGGAAGATG CTTTCAAGTT CTCTGCTCCC 
TATGCATTTT TATAAGACCA TGGGACTTTT GCTGGCTTTA GATCAGCCTC 
TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 
GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG CATCGCATTG 
TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 
AATAGCAGGC ATGCTGGGGA TGCGGTGGGC TCTATGGAAC CAGCTGGGGC 
AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 
CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 
CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 
CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 
GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 
GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 
TGTGTTGGGA GCTTGGATCG ATCCTCTATG GTTGAACAAG ATGGATTGCA 
CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG CACAACAGAC 
TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC CGGTTCTTTT 
GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG CGCGGCTATC 
ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 
CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 
AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA CGCTTGATCC 
CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC GTACTCGGAT 
CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC TCGCGCCAGC 
GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG TCGTGACCCA 
TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG GATTCATCGA 
CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA CCCGTGATAT 
CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG GTATCGCCGC 
CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT GAGCGGGACT 



474Q 
"4800 * 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5530 . 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6430 
6540 
6600 
6660 
... 6720 
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CTGGGGTTCG 
ACCGCCGCCT 
ATCCTCCAGC 
GCTTATAATG 
TCACTGCATT 
GCGGCCGCGA 
ATTGTTATCC 
GGGGTGCCTA 
AGTCGGGAAA 
GTTTGCGTAT 
GGCTGCGGCG 
GGGATAACGC 
AGGCCGCGTT 
GACGCTCAAG 
CTGGAAGCTC 
CCTTTCTCCC 
CGGTGTAGGT 
GCTGCGCCTT 
CACTGGCAGC 
AGTTCTTGAA 
CTCTGCTGAA 
CCACCGCTGG 
GATCTCAAGA 
CACGTTAAGG 
ATTAAAAATG 
ACCAATGCTT 
TTGCCTGACT 
GTGCTGCAAT 
AGCCAGCCGG 
CTATTAATTG 
TTGTTGCCAT 
GCTCCGGTTC 
TTAGCTCCTT 
TGGTTATGGC 



AAATGACCGA^^AGCGACG 

TCTATGAAAG GTTGGGCTTC 
GCGGGGATCT. CATGCTGGAG 
GTTACAAATA AAGCAATAGC 
CTAGTTGTGG TTTGTCCAAA 
TCCCGTCGAG AGCTTGGCGT 
GCTCACAATT CCACACAACA 
ATGAGTGAGC TAACTCACAT 
CCTGTCGTGC CAGCTGCATT 
TGGGCGCTCT TCCGCTTCCT 
AGCGGTATCA GCTCACTCAA 
AGGAAAGAAC ATGTGAGCAA 
GCTGGCGTTT TTCCATAGGC 
TCAGAGGTGG CGAAACCCGA 
CCTCGTGCGC TCTCCTGTTC 
TTCGGGAAGC GTGGCGCTTT 
CGTTCGCTCC AAGCTGGGCT 
ATCCGGTAAC TATCGTCTTG 
AGCCACTGGT AACAGGATTA 
GTGGTGGCCT AACTACGGCT 
GCCAGTTACC TTCGGAAAAA 
TAGCGGTGGT TTTTTTGTTT 
AGATCCTTTG ATCTTTTCTA 
GATTTTGGTC ATGAGATTAT 
AAGTTTTAAA TCAATCTAAA 
AATCAGTGAG GCACCTATCT 
CCCCGTCGTG TAGATAACTA 
GATACCGCGA GACCCACGCT 
AAGGGCCGAG CGCAGAAGTG 
TTGCCGGGAA GCTAGAGTAA 
TGCTACAGGC ATCGTGGTGT 
CCAACGATCA AGGCGAGTTA 
CGGTCCTCCG ATCGTTGTCA 
AGCACTGCAT AATTCTCTTA 



CCCAACCTGC 
GGAATCGTTT 
TTCTTCGCCC 
ATCACAAATT 
CTCATCAATC 
AATCATGGTC 
TACGAGCCGG 
TAATTGCGTT 
AATGAATCGG 
CGCTCACTGA 
AGGCGGTAAT 
AAGGCCAGCA 
TCCGCCCCCC 
CAGGACTATA 
CGACCCTGCC 
CTCAATGCTC 
GTGTGCACGA 
AGTCCAACCC 
GCAGAGCGAG 
AGACTAGAAG 
GAGTTGGTAG 
GCAAGCAGCA 
CGGGGTCTGA 
CAAAAAGGAT 
GTATATATGA 
CAGCGATCTG 
CGATACGGGA 
CACCGGCTCC 
GTCCTGCAAC 
GTAGTTCGCC 
CACGCTCGTC 
CATGATCCCC 
GAAGTAAGTT 
CTGTCATGCC 



CATCACGAGA 
TCCGGGACGC 
ACCCCAACTT 
TCACAAATAA 
TATCTTATCA 
ATAGCTGTTT 
AAGCATAAAG 
GCGCTCACTG 
CCAACGCGCG 
CTCGCTGCGC 
ACGGTTATCC 
AAAGGCCAGG 
TGACGAGCAT 
AAGATACCAG 
GCTTACCGGA 
ACGCTGTAGG 
ACCCCCCGTT 
GGTAAGACAC 
GTATGTAGGC 
GACAGTATTT 
CTCTTGATCC 
GATTACGCGC 
CGCTCAGTGG 
CTTCACCTAG 
GTAAACTTGG 
TCTATTTCGT 
GGGCTTACCA 
AGATTTATCA 
TTTATCCGCC 
AGTTAATAGT 
GTTTGGTATG 
CATGTTGTGC 
GGCCGCAGTG 
ATCCGTAAGA 



CGATTCC 
CGGCTGGATG ' 
GTTTATTGCA 
AGCATTTTTT 
TGTCTGGATC 
CCTGTGTGAA 
TGTAAAGCCT 
CCCGCTTTCC 
GGGAGAGGCG 
TCGGTCGTTC 
ACAGAATCAG 
AACCGTAAAA 
CACAAAAATC 
GCGTTTCCCC 
TACCTGTCCG 
TATCTCAGTT 
CAGCCCGACC 
GACTTATCGC 
GGTGCTACAG 
GGTATCTGCG 
GGCAAACAAA 
AGAAAAAAAG 
AACGAAAACT 
ATCCTTTTAA 
TCTGACAGTT 
TCATCCATAG 
TCTGGCCCCA 
GCAATAAACC 
TCCATCCAGT 
TTGCGCAACG 
GCTTCATTCA 
AAAAAAGCGG 
TTATCACTCA 
TGCTTTTCTG 



6780 

6840 F 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 
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m 

ii's a 



TGACTGGTGA 
CTTGCCCGGC 
TCATTGGAAA 
GTTCGATGTA 
TTTCTGGGTG 
GGAAATGTTG 
ATTGTCTCAT 
CGCGCACATT 



GTACTCAACC 
GTCAATACGG 
ACGTTCTTCG 
ACCCACTCGT 
AGCAAAAACA 
AATACTCATA 
GAGCGGATAC 
TCCCCGAAAA 



^^CATTCT 

GATAATACCG 
GGGCGAAAAC 
GCACCCAACT 
GGAAGGCAAA 
CTCTTCCTTT 
ATATTTGAAT 
GTGCCACCT 



GAGAATAGTG TATGCGGCGA ^^AGTTGCT 
CGCCACATAG CAGAACTTTA AAAGTGCTCA 
TCTCAAGGAT CTTACCGCTG TTGAGATCCA 
GATCTTCAGC ATCTTTTACT TTCACCAGCG 
ATGCCGCAAA AAAGGGAATA AGGGCGACAC 
TTCAATATTA TTGAAGCATT TATCAGGGTT 
GTATTTAGAA AAATAAACAA ATAGGGGTTC 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATCACAGATC TCTCACCATG GATTTTCAGG TGCAGATTAT CAGCTTC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGCAGCATCC GTACGTTTGA TTTCCAGCTT 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



8820 
8880 
8940 
9000 
9060 
9120 
9180 
9209 



47 



30 
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<iv) ANTI- SENSE: 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..384 

(ix) FEATURE: 

(A) NAME/KEY: mat_pepta.de 

(B) LOCATION: 67.. 384 



(xi) SEQUENCE DESCRIPTION: SEQ -ID NO:S: 

ATG GAT TTT CAG GTG CAG ATT ATC AGC TTC CTG CTA ATC ACT GCT TCA 
52 Sp Phe Sn Val Gin lie lie Ser Phe Leu Leu lie Ser Ala Ser 
-22 -20 " 1S 

GTC ATA ATG TCC AGA GGG CAA ATT GTT CTC TCC CAG TCT CCA GCA ATC 
S£ lie Met Ser Arg Gly Gin lie Val Leu Ser Gin Ser Pro Ala lie 
-5 -11 5 

CTG TCT GCA TCT CCA GGG GAG AAG GTC ACA ATG ACT TGC AGG GCC AGC 
S sS S sS Pro Gly Glu Lys Val Thr Met Thr Cys Arg Ala Ser 

IS 20 

_ „_ Tnr ar C CAC TGG TTC CAG CAG AAG CCA GGA TCC TCC 

12 ?S £ 5? S SS ?rp Ph ; Sin «. Lys Pre Gly Ser Ser 

in 35 * u 



CCC AAA CCC TGG ATT TAT GCC ACA TCC AAC CTG GCT TCT GGA GTC CCT 
Pro £yt Pro Trp He Tyr Ala Thr Ser Asn Leu Ala Ser Gly Val Pro 
45 50 

GTT CGC TTC AGT GGC AGT GGG TCT GGG ACT TCT TAC TCT CTC ACA ATC 
Sal Arg Se Ser Gly Ser Gly Ser Gly Tor Ser Tyr Ser Leu Thr lie 

SO 



*™ r^r rar rrT GAA GAT GCT GCC ACT TAT TAC TGC CAG CAG TGG 
i£ SIl Su SI £S Sp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp 
75 80 85 

ACT AGT AAC CCA CCC ACG TTC GGA GGG GGG ACC AAG CTG GAA ATC AAA 
t£ sfr pS Pro Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 

100 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GCGGCTCCCA CGCGTGTCCT GTCCCAG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) -SEQUENCE CHARACTERISTICS: 



48 



36 



144 



192 



240 



288 



33$ 



384 



27 
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(A) LENGTH: Base bairs 

(B) TYPE: nucRic aaid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ((genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 3 T 
(D) OTHER INFORMAT 

is G or C." 



::ON: /note* "Nucleotide 3 is N wherein N 



(ix) FEATURE: 

(A) NAME/KEY: misc:_f eature 

(B) LOCATION: 19 
(D) OTHER INFORMATION: 

N is A or G." 



(ix) FEATURE: 

(A) NAME/KEY: mis c _f eature 

(B) LOCATION: 18 . %<r . 
(D) OTHER INFORMATION: /note- "Nucleotide IB is N wherein 

N is A or C. " 



/note- "Nucleotide 19 is N wherein 



(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 25 . . w , 
(D) OTHER INFORMATION: /note- "Nucleotide 25 is N wherein 

N is G or A." 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
GGNTGTTGTG CTAGCTGNNG AGACtfGTGA 



29 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: li.near 

(ii) MOLECULE TYPE; D1IA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: <33S 

(B) LOCATION: L. .420 

(ix) FEATURE: / 

(A) NAME /KEY : / mat jpeptide 

(B) LOCATION:/ 58. .420 
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(xi) SEQUENCE DES 



;0N: SEQ ID NO: 9): 



ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT 
Met Gly Trp Ser Leu lie Leu Leu Phe Leu Val Ala Val Ala Thr Arg 
-19 "IS " 10 " 5 

rrr CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG 
?I? Zl Ser £n 52 Sn Leu Gin Gin Pro Ty Ala Glu Leu Val Lys 

-11 5 / 10 



CCT GGG GCC TCA GTG AAG ATG TCC TGC; AAG 
Pro Gly Ala Ser Val Lys Met Ser Cys Lys 
15 20 



TCT GGC TAC ACA TTT 
La Ser Gly Tyr Tlir Phe 
25 



arr a G T TAC AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG 
£ Ser 5£ tel Me? His Trp Val Lys Gin Thr Pro Gly Arg Gly Leu 
30 35 I 40 45 

GAA TGG ATT GGA GCT ATT TAT CCC GGA AAT I GGT GAT ACT TCC TAC AAT 
S£ £p lie Gly Ala lie Tyr Pro Gly Asn Gly Asp Thr Ser Tyr Asn 
50 55 ' 60 



CAG AAG TTC AAA GGC AAG GCC ACA TTG AC1 
Gin Lys Phe Lys Gly Lys Ala Thr Leu Tt~ 
65 70 

ACA GCC TAC ATG CAG CTC AGC AGC CTG AC 
Thr Ala Tyr Met Gin Leu Ser Ser Leu T* 
80 85 



GCA GAC AAA TCC TCC AGC 
Ala Asp Lys Ser Ser Ser 
75 

TCT GAG GAC TCT GCG GTC 
Ser Glu Asp Ser Ala Val 
90 



TAT TAC TGT GCA AGA TCG ACT TAC TAC GfeC GGT GAC TGG TAC TTC AAT 
Tyr Tyr Cys Ala Arg Ser Thr Tyr Tyr (£y Gly Asp Trp Tyr Phe Asn 
1 95 100 I 105 



GTC TGG GGC GCA GGG ACC ACG GTC ACC 
Val Trp Gly Ala Gly Thr Thr Val Thr 
110 US 



3TC TCT GCA 
7al Ser Ala 
120 



48 



96 



144 



192 



240 



288 



336 



384 



420 
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